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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: PLANT GENETIC SYSTEMS N.V. 

(B) STREET: Plateaus traat 22 

(C) CITY: Gent 

(E) COUNTRY: Belgium 

(F) POSTAL CODE (ZIP) : 9000 

(G) TELEPHONE: 32 9 2358454 

(H) TELEFAX: 32 9 224 06 94 

(I) TELEX: 11.361 Pgsgen 

Mi) TITLE OF INVENTION: New Bacillus thuringiensis strains 
and their insecticidal proteins 

(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.2b(EPO) 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH : 19 nucleotides 

(B) SEQUENCE TYPE : nucleic acid probe 

( C ) STRANDEDNESS : single stranded 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: synthetic DNA 

(ix) FEATURES: the probe is a part of the coding DNA 

strand of the crvIG gene, described by Smulevitch et al . 
(1991). 

(x) PROPERTIES : this probe is used to isolate the 

bT50261BA gene from its containing strain. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

5 ' -TTCTGTACTATTGATTGTA-3 ' 
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(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1561 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thurmoiensis 
(Bl STRAIN: BTS02618A 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(D) OTHER INFORMATION: /function= "contains the translation 
initiation codon of the bTS02618A gene" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



AAAAAGAAAT 


AGGAATAAAT 


ACTATCCATT 


TTTTCAAGAA 


ATATTTTTTT 


ATTAGAAAGG 


60 


AATCTTTCTT 


ACACGGGAAA 


ATCCTAAGAT 


TGAGAGTAAA 


GATATATATA 


TATAAATACA 


120 


ATAAAGAGTT 


TGTCAGGATT 


TTTGAAAGAT 


ATGATATGAA 


CATGCACTAG 


ATTTATAGTA 


180 


TAGGAGGAAA 


AAGTATGAAT 


CGAAATAATC 


AAAATGAATA 


TGAAATTATT 


GATGCCCCCC 


240 


ATTGTGGGTG 


TCCATCAGAT 


GACGATGTGA 


GGTATCCTTT 


GGCAAGTGAC 


CCAAATGCAG 


300 


CGTTACAAAA 


TATGAACTAT 


AAAGATTACT 


TACAAATGAC 


AGATGAGGAC 


TACACTGATT 


360 


CTTATATAAA 


TCCTAGTTTA 


TCTATTAGTG 


GTAGAGATGC 


AGTTCAGACT 


GCGCTTACTG 


420 


TTGTTGGGAG ■ 


AATACTCGGG 


GCTTTAGGTG 


TTCCGTTTTC TGGACAAATA GTGAGTTTTT 


480 


ATCAATTCCT 


TTTAAATACA 


CTGTGGCCAG 


TTAATGATAC 


AGCTATATGG 


GAAGCTTTCA 


540 


TGCGACAGGT 


GGAGGAACTT 


GTCAATCAAC 


AAATAACAGA 


ATTTGCAAGA 


AATCAGGCAC 


600 


TTGCAAGATT 


GCAAGGATTA 


GGAGACTCTT 


TTAATGTATA 


TCAACGTTCC 


CTTCAAAATT 


660 


GGTTGGCTGA 


TCGAAATGAT 


ACACGAAATT 


TAAGTGTTGT 


TCGTGCTCAA 


TTTATAGCTT 


720 


TAGACCTTGA 


TTTTGTTAAT 


GCTATTCCAT 


TGTTTGCAGT 


AAATGGACAG 


CAGGTTCCAT 


780 


TACTGTCAGT 


ATATGCACAA 


GCTGTGAATT 


TACATTTGTT 


ATTATTAAAA 


GATGCATCTC 


840 


TTTTTGGAGA 


AGGATGGGGA 


TTCACACAGG 


GGGAAATTTC 


CACATATTAT 


GACCGTCAAT 


900 


TGGAACTAAC 


CGCTAAGTAC 


ACTAATTACT 


GTGAAACTTG 


GTATAATACA 


GGTTTAGATC 


960 


GTTTAAGAGG 


AACAAATACT 


GAAAGTTGGT 


TAAGATATCA 


TCAATTCCGT 


AGAGAAATGA 


1020 


CTTTAGTGGT 


ATTAGATGTT 


GTGGCGCTAT 


TTCCATATTA 


TGATGTACGA 


CTTTATCCAA 


1080 


CGGGATCAAA 


CCCACAGCTT 


ACACGTGAGG 


TATATACAGA 


TCCGATTGTA 


TTTAATCCAC 


1140 


CAGCTAATGT 


TGGACTTTGC 


CGACGTTGGG 


G TACT AATCC 


CTATAATACT 


TTTTCTGAGC 


1200 


TCGAAAATGC 


CTTCATTCGC 


CCACCACATC 


TTTTTGATAG 


GCTGAATAGC 


TTAACAATCA 


1260 
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GCAGTAATCG 


ATTTCCAGTT 


TCATCTAATT 


TTATGGATTA 


TTUo 1 L. AtaSjA 


rATACGTTAC 


1320 


GCCGTAGTTA 


TCTGAACGAT 


TCAGCAGTAC 


AAGAAGATAG 


TTATGGCCTA 


ATTACAACCA 


1380 


CAAGAGCAAC 


AATTAATCCC 


GGAGTTGATG 


GAACAAACCG 


CATAGAGTCA 


ACGGCAGTAG 


1440 


ATTTTCGTTC 


TGCATTGATA 


GGTATATATG 


GCGTGAATAG 


AGCTTCTTTT 


GTCCCAGGAG 


1500 


GCTTGTTTAA 


TGGTACGACT 


TCTCCTGCTA 


ATGGAGGATG 


TAGAGATCTC 


TATGATACAA 


1560 
1561 
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(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

( A ) - ORGANISM : Bacillus thurin qiensis 

(B) STRAIN: BTS02618A 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1146.. 1148 

(D) OTHER INFORMATION: /function= "Presumed 
stop codon of bTS02618A gene" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



AAAATTATCC 


AACATACATT 


TATCAAAAAG 


TAGATGCATC 


GGTGTTAAAG 


CCTTATACAC 


60 


GCTATAGACT 


AGATGGATTT 


GTGAAGNGTA 


GTCAAGATTT 


AGAAATTGAT 


CTCATCCACC 


120 


ATCATAAAGT 


CCATCTTGTA 


AAAAATGTAC 


CAGATAATTT 


AGTATCTGAT 


ACTTACTCAG 


180 


ATGGTTCTTG 


CAGCGGAATC 


AACCGTTGTG 


ATGAACAGCA 


TCAGGTAGAT 


ATGCAGCTAG 


240 


ATGCGGAGCA 


TCATCCAATG 


GATTGCTGTG 


AAGCGGCTCA 


AACACATGAG 


TTTTCTTCCT 


300 


ATATTAATAC 


AGGGGATCTA 


AATGCAAGTG 


TAGATCAGGG 


CATTTGGGTT 


GTATTAAAAG 


360 


TTCGAACAAC 


AGATGGGTAT 


GCGACGTTAG 


GAAATCTTGA 


ATTGGTAGAG 


GTTGGGCCAT 


420 


TATCGGGTGA 


ATCTCTAGAA 


CGGGAACAAA 


GAGATAATGC 


GAAATGGAAT 


GCAGAGCTAG 


480 


GAAGAAAACG 


TGCAGAAATA 


GATCGTGTGT 


ATTTAGCTGC 


GAAACAAGCA 


ATTAATCATC 


540 


TGTTTGTAGA 


CTATCAAGAT 


CAACAATTAA 


ATCCAGAAAT 


TGGGCTAGCA 


GAAATTAATG 


600 


AAGCTTCAAA 


TCTTGTAGAG 


TCAATTTCGG 


GTGTATATAG 


TGATACACTA 


TTACAGATTC 


660 


CTGGGATTAA 


CTACGAAATT 


TACACAGAGT 


TATCCGATCG 


CTTACAACAA 


GCATCGTATC 


720 


TGTATACGTC 


TAGAAATGCG 


GTGCAAAATG 


GAGACTTTAA 


CAGTGGTCTA 


GATAGTTGGA 


780 


ATACAACTAT 


GGATGCATCG 


GTTCAGCAAG 


ATGGCAATAT 


GCATTTCTTA 


GTTCTTTCGC 


840 


ATTGGGATGC 


ACAAGTTTCC 


CAACAATTGA 


GAGTAAATCC 


GAATTGTAAG 


TATGTCTTAC 


900 


GTGTGACAGC 


AAGAAAAGTA 


GGAGGCGGAG 


ATGGATACGT 


CACAATCCGA 


GATGGCGCTC 


960 


ATCACCAAGA 


AACTCTTACA 


TTTAATGCAT 


GTGACTACGA 


TGTAAATGGT 


ACGTATGTCA 


1020 


ATGACAATTC 


GTATATAACA 


GAAGAAGTGG 


TATTCTACCC 


AGAGACAAAA 


CATATGTGGG 


1080 


TAGAGGTGAG 


TGAATCCGAA 


GGTTCATTCT 


ATATAGACAG 


TATTGAGTTT 


ATTGAAACAC 


1140 


AAGAGTAGAA 


GAGGGGGATC 


CTAACGTATA 


GCAACTATGA 


GAGGATACTC 


CGTACAAACA 


1200 
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AAGATTAAAA AAAGGTAAAA TGAATAGAAC CCCCTACTGG TAGAAGGACC GATAGGGGGT 1260 

TCTTACATGA AAAAATGTAG CTGTTTACTA AGGTGTATAA AAAACAGCAT ATCTGATAGA 1320 

AAAAAGTGAG TACCTTATAA AGAAAGAATT CCATTCACAG TTTCGGTATC ATATAAATAA 13 80 

TGATAGGGGT ATCCTTCTTA TTTACATTAT TTTTCGCAAT TATCTCGACG TTCTTCTTTC 1440 

CGCTCACAAT GATGATGATC ATGACAACAA TCGCGTCCAT AGCGAACTCT TTCGATATTA 1500 

ATAATATCTA AACTCGTGTA GCAGTCATTT CCATTTTTTT TGATCCAGTA AATA 1554 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 668.. 4141 

(D) OTHER INFORMATION: encompasses the entire sequence or ssg 
ID NO 2V t£m nSKotid. position 474 to 2034 in SEQ ID NO 4; also 
encomoasses part of the sequence of SEQ ID NO 3: from nucleotide 
lllVSo* "2994 to nucleotide position 4344 in SEQ ID NO 4; SEQ ID NO 3 
shows additional nucleotides, located downstream (3') from the 
sequence shown in SEQ ID NO 4 (nucleotide position 1352 to nucleotide 
position 1554 in SEQ ID NO 3) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GAATTCGAGC TCGGTACCTT TTCAGTGTAT CGTTTCCCTT CCATCAGGTT TTCAAATTGA 60 

AAAGCCGAAT GATTTGAAAC TTGTTTACGA TGTAAGTCAT TTGTCTATGA CGAAAGATAC 120 

GTGTAAAAAA CGTATTGAGA TTGATGAATG TGGACAAGTA GAAATTGACT TACAAGTATT 180 

AAAGATTAAG GGTGTCCTTT CTTTTATCGG AAATTTCTCT ATTGAACCTA TTCTGTGTGA 240 

AAACATGTAT ACAACGGTTG ATAGAGATCC GTCTATTTCC TTAAGTTTCC AAGATACGGT 3 00 

ATATGTGGAC CATATTTTAA AATATAGCGT CCAACAACTA CCATATTATG TAATTGATGG 360 

TGATCATATT CAAGTACGTG ATTTACAAAT CAAACTGATG AAAGAGAATC CGCAATCTGC 420 

TCAAGTATCA GGTTTGTTTT GTTTTGTATA TGAGTAAGAA CCGAAGGTTT GTAAAAAAGA 480 

AATAGGAATA AATACTATCC ATTTTTTCAA GAAATATTTT TTTATTAGAA AGGAATCTTT 54 0 

CTTACACGGG AAAATCCTAA GATTGAGAGT AAAGATATAT ATATATAAAT ACAATAAAGA 600 

GTTTGTCAGG ATTTTTGAAA GATATGATAT GAACATGCAC TAGATTTATA GTATAGGAGG 660 

AAAAAGT ATG AAT CGA AAT AAT CAA AAT GAA TAT GAA ATT ATT GAT GCC 709 
Met Asn Arg Asn Asn Gin Asn Glu Tyr Glu lie lie Asp Ala 
1 " 5 10 

CCC CAT TGT GGG TGT CCA TCA GAT GAC GAT GTG AGG TAT CCT TTG GCA 757 
Pro His Cys Gly Cys Pro Ser Asp Asp Asp Val Arg Tyr Pro Leu Ala 
15 " 20 25 30 

AGT GAC CCA AAT GCA GCG TTA CAA AAT ATG AAC TAT AAA GAT TAC TTA 805 
Ser Asp Pro Asn Ala Ala Leu Gin Asn Met Asn Tyr Lys Asp Tyr Leu 
35 40 45 

CAA ATG ACA GAT GAG GAC TAC ACT GAT TCT TAT ATA AAT CCT AGT TTA 853 
Gin Met Thr Asp Glu Asp Tyr Thr Asp Ser Tyr lie Asn Pro Ser Leu 
50 55 60 
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TCT ATT ACT GGT AGA GAT GCA GTT CAG ACT GCG CTT ACT GTT GTT GGG 
III lie Ser Gly Arg Asp Ala Val Gin Thr Ala Leu Thr Val Val Gly 
65 70 75 

AGA ATA CTC GGG GCT TTA GGT GTT CCG TTT TCT GGA CAA ATA GTG AGT 
lie Leu Gly Ala Leu Gly Val Pro Phe Ser Gly Gin He Val Ser 
80 85 9° 

TAT CAA TTC CTT TTA AAT ACA CTG TGG CCA GTT AAT GAT ACA GCT 
Phe Tyr Gin Phe Leu Leu Asn Thr Leu Trp Pro Val Asn Asp Thr Ala 
95 100 105 110 

ATA TGG GAA GCT TTC ATG CGA CAG GTG GAG GAA CTT GTC AAT CAA CAA 
lie Trp Glu Ala Phe Met Arg Gin Val Glu Glu Leu Val Asn Gin Gin 
115 120 12 5 

ATA ACA GAA TTT GCA AGA AAT CAG GCA CTT GCA AGA TTG CAA GGA TTA 
He Thr Glu Phe Ala Arg Asn Gin Ala Leu Ala Arg Leu Gin Gly Leu 
130 I 35 140 

GGA GAC TCT TTT AAT GTA TAT CAA CGT TCC CTT CAA AAT TGG TTG GCT 
Gly Asp Ser Phe Asn Val Tyr Gin Arg Ser Leu Gin Asn Trp Leu Ala 
145 - 150 155 

GAT CGA AAT GAT ACA CGA AAT TTA AGT GTT GTT CGT GCT CAA TTT ATA 
Asp Arg Asn Asp Thr Arg Asn Leu Ser Val Val Arg Ala Gin Phe He 
160 " 165 170 

GC^ TTA GAC CTT GAT TTT GTT AAT GCT ATT CCA TTG TTT GCA GTA AAT 
Ala Leu Asp Leu Asp Phe Val Asn Ala He Pro Leu Phe Ala Val Asn 
175 180 185 190 

GGA CAG CAG GTT CCA TTA CTG TCA GTA TAT GCA CAA GCT GTG AAT TTA 
Gly Gin Gin Val Pro Leu Leu Ser Val Tyr Ala Gin Ala Val Asn Leu 
y 195 200 205 

TTG TTA TTA TTA AAA GAT GCA TCT CTT TTT GGA GAA GGA TGG GGA 
H^s Leu Leu Leu Leu Lys Asp Ala Ser Leu Phe Gly Glu Gly Trp Gly 
210 215 220 

T^C ACA CAG GGG GAA ATT TCC ACA TAT TAT GAC CGT CAA TTG GAA CTA 
Phe Thr Gin Gly Glu He Ser Thr Tyr Tyr Asp Arg Gin Leu Glu Leu 
225 ~ 230 235 

ACC GCT AAG TAC ACT AAT TAC TGT GAA ACT TGG TAT AAT ACA GGT TTA 
Thr Ala Lys Tyr Thr Asn Tyr Cys Glu Thr Trp Tyr Asn Thr Gly Leu 
240 245 250 

GA^ CGT TTA AGA GGA ACA AAT ACT GAA AGT TGG TTA AGA TAT CAT CAA 
Asp Arg Leu Arg Gly Thr Asn Thr Glu Ser Trp Leu Arg Tyr His Gin 
255 260 265 270 

TTC CGT AGA GAA ATG ACT TTA GTG GTA TTA GAT GTT GTG GCG CTA TTT 
Phe Arg Arg Glu Met Thr Leu Val Val Leu Asp Val Val Ala Leu Phe 
275 2B0 285 

CCA TAT TAT GAT GTA CGA CTT TAT CCA ACG GGA TCA AAC CCA CAG CTT 
P^o Tv Tyr Asp Val Arg Leu Tyr Pro Thr Gly Ser Asn Pro Gin Leu 
290 " 295 300 



901 



949 



997 



1045 



1093 



1141 



1189 



1237 



1285 



1333 



1381 



1429 



1477 



1525 



1573 
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ACA CGT GAG GTA TAT ACA GAT CCG ATT GTA TTT AAT CCA CCA GCT AAT 
Thr Arg Glu Val Tyr Thr Asp Pro He Val Phe Asn Pro Pro Ala Asn 
305 310 315 

GTT GGA CTT TGC CGA CGT TGG GGT ACT AAT CCC TAT AAT ACT TTT TCT 
Val Gly Leu Cys Arg Arg Trp Gly Thr Asn Pro Tyr Asn Thr Phe Ser 
320 325 33Q 

GAG CTC GAA AAT GCC TTC ATT CGC CCA CCA CAT CTT TTT GAT AGG CTG 
Glu Leu Glu Asn Ala Phe He Arg Pro Pro His Leu Phe Asp Arg Leu 
335 340 345 350 

AAT AGC TTA ACA ATC AGC AGT AAT CGA TTT CCA GTT TCA TCT AAT TTT 
Asn Ser Leu Thr He Ser Ser Asn Arg Phe Pro Val Ser Ser Asn Phe 
355 360 355 

ATG GAT TAT TGG TCA GGA CAT ACG TTA CGC CGT AGT TAT CTG AAC GAT 
Met Asp Tyr Trp Ser Gly His Thr Leu Arg Arg Ser Tyr Leu Asn Asp 
370 375 380 

TCA GCA GTA CAA GAA GAT AGT TAT GGC CTA ATT ACA ACC ACA AGA GCA 
Ser Ala Val Gin Glu Asp Ser Tyr Gly Leu He Thr Thr Thr Arg Ala 
385 390 395 

ACA ATT AAT CCC GGA GTT GAT GGA ACA AAC CGC ATA GAG TCA ACG GCA 
Thr He Asn Pro Gly Val Asp Gly Thr Asn Arg He Glu Ser Thr Ala 
400 405 410 

GTA GAT TTT CGT TCT GCA TTG ATA GGT ATA TAT GGC GTG AAT AGA GCT 
Val Asp Phe Arg Ser Ala Leu He Gly He Tyr Gly Val Asn Arg Ala 
415 420 425 430 

TCT TTT GTC CCA GGA GGC TTG TTT AAT GGT ACG ACT TCT CCT GCT AAT 
Se r phe Val Pro Gly Gly Leu Phe Asn Gly Thr Thr Ser Pro Ala Asn 
435 440 445 

GGA GGA TGT AGA GAT CTC TAT GAT ACA AAT GAT GAA TTA CCA CCA GAT 
Gly Gly Cys Arg Asp Leu Tyr Asp Thr Asn Asp Glu Leu Pro Pro Asp 
450 455 460 

GAA AGT ACC GGA AGT TCA ACC CAT AGA CTA TCT CAT GTT ACC TTT TTT 
Glu Ser Thr Gly Ser Ser Thr His Arg Leu Ser His Val Thr Phe Phe 
465 ~ 470 475 

AGC TTT CAA ACT AAT CAG GCT GGA TCT ATA GCT AAT GCA GGA AGT GTA 
Ser Phe Gin Thr Asn Gin Ala Gly Ser He Ala Asn Ala Gly Ser Val 
480 485 490 

CCT ACT TAT GTT TGG ACC CGT CGT GAT GTG GAC CTT AAT AAT ACG ATT 
Pro Thr Tyr Val Trp Thr Arg Arg Asp Val Asp Leu Asn Asn Thr He 
495 500 505 510 

ACC CCA AAT AGA ATT ACA CAA TTA CCA TTG GTA AAG GCA TCT GCA CCT 
Thr Pro Asn Arg He Thr Gin Leu Pro Leu Val Lys Ala Ser Ala Pro 
515 520 525 

GTT TCG GGT ACT ACG GTC TTA AAA GGT CCA GGA TTT ACA GGA GGG GGT 
Val Ser Gly Thr Thr Val Leu Lys Gly Pro Gly Phe Thr Gly Gly Gly 
530 535 540 



1621 



1669 



1717 



1765 



1813 



1861 



1909 



1957 



2005 



2053 



2101 



2149 



2197 



2245 



2293 
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1T1 rTC CGA AGA ACA AC T AAT GGC ACA TTT GGA ACG TTA AGA GTA ACG 2341 
ne 25 S £rg ?S ?£ Asn Gly Thr Phe Gly Thr Leu Arg Val Thr 
545 550 Sd5 

GTT AAT TCA CCA TTA ACA CAA CAA TAT CGC CTA AGA GTT CGT TTT GCC 2389 
Val Ser Pro Leu Thr Gin Gin Tyr Arg Leu Arg Val Arg Phe Ala 

560 565 570 

TCA ACA GGA AAT TTC AGT ATA AGG GTA CTC CGT GGA GGG GTT TCT ATC 2437 
Ser SS Asn Phe Ser lie Arg Val Leu Arg Gly Gly Val Ser lie 

575 580 585 59J 

rcr GAT GTT AGA TTA GGG AGC ACA ATG AAC AGA GGG CAG GAA CTA ACT 2485 
Sw Sp 511 £rg Su Gly Ser Thr Met Asn Arg Gly Gin Glu Leu Thr 
595 600 oOd 

TAC GAA TCC TT-T TTC ACA AGA GAG TTT ACT ACT ACT GGT CCG TTC AAT 2533 
Tyr Glu Ser Phe Phe Thr Arg Glu Phe Thr Thr Thr Gly Pro Phe Asn 
610 615 620 

CCG CCT TTT ACA TTT ACA CAA GCT CAA GAG ATT CTA ACA GTG AAT GCA 2581 
Pro Pro Phe Thr Phe Thr Gin Ala Gin Glu He Leu Thr Val Asn Ala 
625 630 635 

GAA GGT GTT AGC ACC GGT GGT GAA TAT TAT ATA GAT AGA ATT GAA ATT 2 62 9 

Glu Gly Val Ser Thr Gly Gly Glu Tyr Tyr He Asp Arg He Glu He 
640 645 650 

GTC CCT GTG AAT CCG GCA CGA GAA GCG GAA GAG GAT TTA GAA GCG GCG 2 677 

Val Pro Val Asn Pro Ala Arg Glu Ala Glu Glu Asp Leu Glu Ala Ala 
655 660 665 670 

AAG AAA GCG GTG GCG AGC TTG TTT ACA CGT ACA AGG GAC GGA TTA CAG 272 5 

Lys Lys Ala Val Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gin 
675 680 685 

GTA AAT GTG ACA GAT TAT CAA GTG GAC CAA GCG GCA AAT TTA GTG TCA 2773 
Val 'Asn Val Thr Asp Tyr Gin Val Asp Gin Ala Ala Asn Leu Val Ser 
690 695 700 

TGC TTA TCC GAT GAA CAA TAT GGG CAT GAC AAA AAG ATG TTA TTG GAA 2 821 

Cys Leu Ser Asp Glu Gin Tyr Gly His Asp Lys Lys Met Leu Leu Glu 
705 710 715 

GCG GTA AGA GCG GCA AAA CGC CTC AGC CGC GAA CGC AAC TTA CTT CAA 2 869 

Ala Val Arg Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gin 
720 725 730 

GAT CCA GAT TTT AAT ACA ATC AAT AGT ACA GAA GAG AAT GGC TGG AAG 2917 
Asp Pro Asp Phe Asn Thr He Asn Ser Thr Glu Glu Asn Gly Trp Lys 
735 740 745 750 

GCA AGT AAC GGT GTT ACT ATT AGC GAG GGC GGT CCA TTC TTT AAA GGT 2965 
Ala Ser Asn Gly Val Thr lie Ser Glu Gly Gly Pro Phe Phe Lys Gly 
755 760 765 

CGT GCA CTT CAG TTA GCA AGC GCA AGA GAA AAT TAT CCA ACA TAC ATT 3 013 

Arg Ala Leu Gin Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr He 
770 775 780 
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*, & & rTA G AT GCA TCG GTG TTA AAG CCT TAT ACA CGC TAT AGA 

£r SS Syl S2 Asp Si Ser Val Leu Lys Pro Tyr Thr Arg Tyr Arg 
785 790 

2SSEE5S=SS=B S52S 

800 805 

_ M _ ... ,™ r rAT rTT GTA AAA AAT GTA CCA GAT AAT TTA GTA 

SS SI SI Si SI S SIl *. Asn V.1 P ro Asp Asn Leu «1 
815 820 

^ rtT ACT TAC TCA GAT GGT TCT TGC AGC GGA ATC AAC CGT TGT GAT 
Ser Asp 5S £r SeS Asp Gly Ser Cys Ser Gly He Asn Arg Cys Asp 
835 

_ „_ „ n nAT A TG CAG CTA GAT GCG GAG CAT CAT CCA ATG 

SJ Sn £s 35 Sp Leu Asp Ala «» His Hi. Pro Met 



GAT TGC TGT GAA GCG GCT CAA ACA CAT GAG TTT TCT TCC TAT ATT AAT 
Sp Cys Cys GlS Ala Ala Gin Thr His Glu Phe Ser Ser Tyr He Asn 

865 870 
ar-a r^r GAT CTA AAT GCA AGT GTA GAT CAG GGC ATT TGG GTT GTA TTA 
Thr S£ aS? SS Sin Si Ser Val Asp Gin Gly II. Trp Val Val Leu 

885 B " u 



880 



ill PTT CGA ACA ACA GAT GGG TAT GCG ACG TTA GGA AAT CTT GAA TTG 
£i Val SS ilr tSS Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu 

900 yu3 



945 



3061 



3109 



3157 



3205 



3253 



3301 



3349 



3397 



3445 



3493 



895 

„. r _ r rTT CCA TTA TCG GGT GAA TCT CTA GAA CGG GAA CAA AGA 

S;i Glu Tl Pro S Ser Gly Glu Ser Leu Glu Arg Glu Gin Arg 

GAT AAT GCG AAA TGG AAT GCA GAG CTA GGA AGA AAA CGT GCA GAA ATA 
S£ Jt£ SS.Lys Trp Asn Ala Glu Leu Gly Arg Lys Arg Ala Glu He 
930 935 

GAT CGT GTG TAT TTA GCT GCG AAA CAA GCA ATT AAT CAT CTG TTT GTA 3541 
Sp Arg Val Tyr Leu Ala Ala Lys Gin Ala lie Asn Hi. Leu Phe Val 



3589 



GAC TAT CAA GAT CAA CAA TTA AAT CCA GAA ATT GGG CTA GCA GAA ATT 
JSp T?r gSS Sp Gin Gin Leu Asn Pro Glu He Gly Leu Ala Glu He 

AAT GAA GCT TCA AAT CTT GTA GAG TCA ATT TCG GGT GTA TAT AGT GAT 
SSn SS 22 Ser Asn Leu Val Glu Ser He Ser Gly Val Tyr Ser Asp 
975 980 985 

ACA CTA TTA CAG ATT CCT GGG ATT AAC TAC GAA ATT TAC ACA GAG TTA 
5£ LeS 25 Gin He Pro Gly He Asn Ty^Glu He Tyr Thr Gl^Leu 

TCC GAT CGC TTA CAA CAA GCA TCG TAT CTG TAT ACG TCT AGA AAT GCG 3733 
Ser Arg ™ Gin Gin Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala 

1010 1015 



3637 



3685 
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GTG CAA AAT GGA GAC TTT AAC AGT GGT CTA GAT AGT TGG AAT ACA ACT 37 81 

SS G^n 5£n Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Thr Thr 
1025 1030 j.ujd 

ATG GAT GCA TCG GTT CAG CAA GAT GGC AAT ATG CAT TTC TTA GTT CTT 3829 
Met Asp Ala Ser Val Gin Gin Asp Gly Asn Met His Phe Leu Val Leu 
1040 1° 45 10 

TCG CAT TGG GAT GCA CAA GTT TCC CAA CAA TTG AGA GTA AAT CCG AAT 3877 
Ser His Trp Asp Ala Gin Val Ser Gin Gin Leu Arg Val Asn Pro Asn 
!055 1060 10b=> • LU/U 

TCT AAG TAT GTC TTA CGT GTG ACA GCA AGA AAA GTA GGA GGC GGA GAT 3925 
Cys iy5 Tyr vll LjuArg Val Thr Ala Ar^Lys Val Gly Gly Gly^Asp 

r-r-A TAC GTC ACA ATC CGA GAT GGC GCT CAT CAC CAA GAA ACT CTT ACA 3973 
Sg T?r fal S lie A?i Asp Gly Ala His His Gin Glu Thr Leu Thr 
1090 1095 11UU 

TTT AAT GCA TGT GAC TAC GAT GTA AAT GGT ACG TAT GTC AAT GAC AAT 4021 
Pne Asn Ala Cys Asp Tyr Asp Val Asn Gly Thr Tyr Val Asn Asp Asn 
1105 1110 111b 

TCG TAT ATA ACA GAA GAA GTG GTA TTC TAC CCA GAG ACA AAA CAT ATG 4069 
Ser Tyr He Thr Glu Glu Val Val Phe Tyr Pro Glu Thr Lys His Met 
1120 H25 H30 

TGG GTA GAG GTG AGT GAA TCC GAA GGT TCA TTC TAT ATA GAC AGT ATT 4117 
T-p Val Glu Val Ser Glu Ser Glu Gly Ser Phe Tyr He Asp Ser lie 
1^35 1140 H45 H50 

GAG TTT ATT GAA ACA CAA GAG TAGAAGAGGG GGATCCTAAC GTATAGCAAC 4168 
Glu Phe He Glu Thr Gin Glu 
1155 

TATGAGAGGA TACTCCGTAC AAACAAAGAT TAAAAAAAGG TAAAATGAAT AGAACCCCCT 4228 

ACTGGTAGAA GGACCGATAG GGGGTTCTTA CATGAAAAAA TGTAGCTGTT TACTAAGGTG 4288 

TATAAAAAAC AGCATATCTG ATAGAAAAAA GTGAGTACCT TATAAAGAAA GAATTC 4344 



